Search CORE

arXiv.org e-Print Archive

Longest property-preserved common factor

Author: D Belazzougui
D Gusfield
H Bannai
J-P Duval
L Chi
M Dumitran
M Farach
M Federico
M Lothaire
P Peterlongo
P Peterlongo
S Inenaga
SR Chowdhury
SV Thankachan
SV Thankachan
SW Bae
T Kociumaka
T Starikovskaya
WI Chang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

In this paper we introduce a new family of string processing problems. We are given two or more strings and we are asked to compute a factor common to all strings that preserves a specific property and has maximal length. Here we consider two fundamental string properties: square-free factors and periodic factors under two different settings, one per property. In the first setting, we are given a string x and we are asked to construct a data structure over x answering the following type of on-line queries: given string y, find a longest square-free factor common to x and y. In the second setting, we are given k strings and an integer 1 < k’ ≤ k and we are asked to find a longest periodic factor common to at least k’ strings. We present linear-time solutions for both settings. We anticipate that our paradigm can be extended to other string properties

Archivio istituzionale della ricerca - Università di Trieste

INRIA a CCSD electronic archive server

Archivio della Ricerca - Università di Pisa

King's Research Portal

Association of Mitochondrial DNA Variations with Lung Cancer Risk in a Han Chinese Population from Southwestern China

Mitochondrial DNA (mtDNA) is particularly susceptible to oxidative damage and mutation due to the high rate of reactive oxygen species (ROS) production and limited DNA-repair capacity in mitochondrial. Previous studies demonstrated that the increased mtDNA copy number for compensation for damage, which was associated with cigarette smoking, has been found to be associated with lung cancer risk among heavy smokers. Given that the common and “non-pathological” mtDNA variations determine differences in oxidative phosphorylation performance and ROS production, an important determinant of lung cancer risk, we hypothesize that the mtDNA variations may play roles in lung cancer risk. To test this hypothesis, we conducted a case-control study to compare the frequencies of mtDNA haplogroups and an 822 bp mtDNA deletion between 422 lung cancer patients and 504 controls. Multivariate logistic regression analysis revealed that haplogroups D and F were related to individual lung cancer resistance (OR = 0.465, 95%CI = 0.329–0.656, p<0.001; and OR = 0.622, 95%CI = 0.425–0.909, p = 0.014, respectively), while haplogroups G and M7 might be risk factors for lung cancer (OR = 3.924, 95%CI = 1.757–6.689, p<0.001; and OR = 2.037, 95%CI = 1.253–3.312, p = 0.004, respectively). Additionally, multivariate logistic regression analysis revealed that cigarette smoking was a risk factor for the 822 bp mtDNA deletion. Furthermore, the increased frequencies of the mtDNA deletion in male cigarette smoking subjects of combined cases and controls with haplogroup D indicated that the haplogroup D might be susceptible to DNA damage from external ROS caused by heavy cigarette smoking

CiteSeerX

Origin and Post-Glacial Dispersal of Mitochondrial DNA Haplogroups C and D in Northern Asia

Author: A Achilli
A Achilli
A Chandrasekar
A Hartmann
A Kitchen
A Torroni
A Torroni
AP Okladnikov
B Egyed
B Malyarchuk
B Malyarchuk
B Malyarchuk
B Malyarchuk
BA Malyarchuk
BA Malyarchuk
BA Malyarchuk
BA Malyarchuk
Boris Malyarchuk
C Nohira
D Comas
D Mishmar
D Mishmar
E Tamm
EI Kushnerevich
Galina Denisova
H Ueno
HJ Bandelt
Ilia Zakharov
Irina Dambueva
J Burger
J Saillard
K Tambets
KA Tabbada
KB Schroeder
M Derenko
M Ingman
M Ingman
M Metspalu
M Metspalu
M Rasmussen
M Richards
M Tanaka
M van Oven
M. Thomas P Gilbert
Maria Perkova
Miroslava Derenko
MT Gilbert
MV Derenko
MV Derenko
MV Derenko
MV Derenko
NV Volodko
OA Derbeneva
OA Derbeneva
P Soares
P Soares
QP Kong
QP Kong
RM Andrews
RS Malhi
SA Laukhin
SA Vasiliev
T Goebel
T Goebel
T Grzybowski
Tomasz Grzybowski
UA Perego
UA Perego
Urszula Rogalla
V Macaulay
VV Pitulko
YB Starikovskaya
Z Qin
Publication venue: Public Library of Science
Publication date: 21/12/2010
Field of study

More than a half of the northern Asian pool of human mitochondrial DNA (mtDNA) is fragmented into a number of subclades of haplogroups C and D, two of the most frequent haplogroups throughout northern, eastern, central Asia and America. While there has been considerable recent progress in studying mitochondrial variation in eastern Asia and America at the complete genome resolution, little comparable data is available for regions such as southern Siberia – the area where most of northern Asian haplogroups, including C and D, likely diversified. This gap in our knowledge causes a serious barrier for progress in understanding the demographic pre-history of northern Eurasia in general. Here we describe the phylogeography of haplogroups C and D in the populations of northern and eastern Asia. We have analyzed 770 samples from haplogroups C and D (174 and 596, respectively) at high resolution, including 182 novel complete mtDNA sequences representing haplogroups C and D (83 and 99, respectively). The present-day variation of haplogroups C and D suggests that these mtDNA clades expanded before the Last Glacial Maximum (LGM), with their oldest lineages being present in the eastern Asia. Unlike in eastern Asia, most of the northern Asian variants of haplogroups C and D began the expansion after the LGM, thus pointing to post-glacial re-colonization of northern Asia. Our results show that both haplogroups were involved in migrations, from eastern Asia and southern Siberia to eastern and northeastern Europe, likely during the middle Holocene

The Peopling of Korea Revealed by Analyses of Mitochondrial DNA and Y-Chromosomal Markers

Author: A Brandstätter
A Torroni
AR Marrero
B Richards
B Su
B Wen
B Wen
CG Turner
Chris Tyler-Smith
DC Wallace
EB Starikovskaya
GT Powell
H Li
H Shi
Han-Jun Jin
HJ Bandelt
HJ Bandelt
HJ Bandelt
HJ Jin
HJ Jin
HW Goedde
HY Lee
I Dupanloup
J Sambrook
J Yan
JH Hwang
JY Chu
K Kim
KD Kwak
L Excoffier
L Quintana-Murci
LL Cavalli-Sforza
LL Cavalli-Sforza
M Hara
M Nei
M Tanaka
MA Jobling
Mark A. Batzer
MF Hammer
ML Choi
MV Derenko
MV Derenko
N Saha
PA Underhill
PA Underhill
PA Zalloua
Q Zhang
QP Kong
R Qamar
RE Giles
RM Andrews
RR Sokal
RY Yong
S Maruyama
S Schneider
S Wells
SB Hong
SS Hong
T Karafet
T Kivisild
TY Huang
W Kim
Wook Kim
Y Xue
Y Zhang
YG Yao
YG Yao
YG Yao
YH Han
YJ Zhang
Publication venue: Public Library of Science
Publication date: 16/01/2009
Field of study

The Koreans are generally considered a northeast Asian group because of their geographical location. However, recent findings from Y chromosome studies showed that the Korean population contains lineages from both southern and northern parts of East Asia. To understand the genetic history and relationships of Korea more fully, additional data and analyses are necessary.We analyzed mitochondrial DNA (mtDNA) sequence variation in the hypervariable segments I and II (HVS-I and HVS-II) and haplogroup-specific mutations in coding regions in 445 individuals from seven east Asian populations (Korean, Korean-Chinese, Mongolian, Manchurian, Han (Beijing), Vietnamese and Thais). In addition, published mtDNA haplogroup data (N = 3307), mtDNA HVS-I sequences (N = 2313), Y chromosome haplogroup data (N = 1697) and Y chromosome STR data (N = 2713) were analyzed to elucidate the genetic structure of East Asian populations. All the mtDNA profiles studied here were classified into subsets of haplogroups common in East Asia, with just two exceptions. In general, the Korean mtDNA profiles revealed similarities to other northeastern Asian populations through analysis of individual haplogroup distributions, genetic distances between populations or an analysis of molecular variance, although a minor southern contribution was also suggested. Reanalysis of Y-chromosomal data confirmed both the overall similarity to other northeastern populations, and also a larger paternal contribution from southeastern populations.The present work provides evidence that peopling of Korea can be seen as a complex process, interpreted as an early northern Asian settlement with at least one subsequent male-biased southern-to-northern migration, possibly associated with the spread of rice agriculture

Archaeological Support for the Three-Stage Expansion of Modern Humans across Northeastern Eurasia and into the Americas

Background Understanding the dynamics of the human range expansion across northeastern Eurasia during the late Pleistocene is central to establishing empirical temporal constraints on the colonization of the Americas [1]. Opinions vary widely on how and when the Americas were colonized, with advocates supporting either a pre-[2] or post-[1], [3], [4], [5], [6] last glacial maximum (LGM) colonization, via either a land bridge across Beringia [3], [4], [5], a sea-faring Pacific Rim coastal route [1], [3], a trans-Arctic route [4], or a trans-Atlantic oceanic route [5]. Here we analyze a large sample of radiocarbon dates from the northeast Eurasian Upper Paleolithic to identify the origin of this expansion, and estimate the velocity of colonization wave as it moved across northern Eurasia and into the Americas. Methodology/Principal Findings We use diffusion models [6], [7] to quantify these dynamics. Our results show the expansion originated in the Altai region of southern Siberia ~46kBP , and from there expanded across northern Eurasia at an average velocity of 0.16 km per year. However, the movement of the colonizing wave was not continuous but underwent three distinct phases: 1) an initial expansion from 47-32k calBP; 2) a hiatus from ~32-16k calBP, and 3) a second expansion after the LGM ~16k calBP. These results provide archaeological support for the recently proposed three-stage model of the colonization of the Americas [8], [9]. Our results falsify the hypothesis of a pre-LGM terrestrial colonization of the Americas and we discuss the importance of these empirical results in the light of alternative models. Conclusions/Significance Our results demonstrate that the radiocarbon record of Upper Paleolithic northeastern Eurasia supports a post-LGM terrestrial colonization of the Americas falsifying the proposed pre-LGM terrestrial colonization of the Americas. We show that this expansion was not a simple process, but proceeded in three phases, consistent with genetic data, largely in response to the variable climatic conditions of late Pleistocene northeast Eurasia. Further, the constraints imposed by the spatiotemporal gradient in the empirical radiocarbon record across this entire region suggests that North America cannot have been colonized much before the existing Clovis radiocarbon record suggests

Simon Fraser University Institutional Repository

Beringian Standstill and Spread of Native American Founders

Author: A Helgason
A Torroni
AL Non
B Pakendorf
BA Malyarchuk
BM Kemp
C Herrnstadt
C Herrnstadt
CJ Kolman
CJ Kolman
Claudio M. Bravi
Connie J. Mulligan
Cristina Martinez-Labarga
D Mishmar
DA Merriewther
DA Merriwether
David Glenn Smith
Dee Carter
EB Starikovskaya
EJ Szathmary
Elsa K. Khusnutdinova
Erika Tamm
FA Kaestle
G Bailliet
GF Shields
HJ Bandelt
J Saillard
J Wakeley
JA Eshleman
JG Lorenz
Jose E. Dipierri
KB Schroeder
Larisa Damba
Ludmila P. Ossipova
M Hasegawa
M Ingman
M Tanaka
MA Bermisheva
Maere Reidla
Mait Metspalu
Maria V. Golubenko
Marina A. Gubina
MD Brown
MH Crawford
Mikhail I. Voevoda
N Maca-Meyer
O Rickards
OA Derbeneva
Olga Rickards
P Forster
PE Melton
Q-P Kong
RH Ward
Richard Villems
Ripan S. Malhi
RS Malhi
RS Malhi
S Horai
S Sigurğardóttir
SA Fedorova
Sardana A. Fedorova
SE Santos
Sergey I. Zhadanov
SL Bonatto
T Kivisild
TD Dillehay
TG Schurr
TG Schurr
Toomas Kivisild
TV Goltsova
Vadim A. Stepanov
VV Pitulko
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Native Americans derive from a small number of Asian founders who likely arrived to the Americas via Beringia. However, additional details about the intial colonization of the Americas remain unclear. To investigate the pioneering phase in the Americas we analyzed a total of 623 complete mtDNAs from the Americas and Asia, including 20 new complete mtDNAs from the Americas and seven from Asia. This sequence data was used to direct high-resolution genotyping from 20 American and 26 Asian populations. Here we describe more genetic diversity within the founder population than was previously reported. The newly resolved phylogenetic structure suggests that ancestors of Native Americans paused when they reached Beringia, during which time New World founder lineages differentiated from their Asian sister-clades. This pause in movement was followed by a swift migration southward that distributed the founder types all the way to South America. The data also suggest more recent bi-directional gene flow between Siberia and the North American Arctic

CiteSeerX